Fast and Reliable Object Classification in Video Based on a 3d Generic Model
نویسندگان
چکیده
We propose a new object classification approach for monocular video sequences, which allows to classify objects modelled independently from the camera position and object orientation. To achieve this independence, a simple 3D object model that represents an object as a parallelepiped is proposed. The approach is able to give good estimates of object dimensions and proposes visual reliability measures for the object dimensions. These measures give a representation of the visibility of the estimated dimension and are principally proposed to aid posterior phases of the video understanding process, as object tracking and event detection. The method obtains the 3D parallelepiped model estimation using a set of 2D moving regions (obtained in a segmentation phase), the perspective matrix transform (obtained from camera calibration using the pin-hole camera model) and predefined 3D models of expected objects in the scene. After classification, a merging step is performed to improve the classification performance by assembling 2D moving regions with better 3D model probability when together. This approach shows promising results on object classification, obtaining very high detection rates for complex situations and performing at video frame rate.
منابع مشابه
3D Scene and Object Classification Based on Information Complexity of Depth Data
In this paper the problem of 3D scene and object classification from depth data is addressed. In contrast to high-dimensional feature-based representation, the depth data is described in a low dimensional space. In order to remedy the curse of dimensionality problem, the depth data is described by a sparse model over a learned dictionary. Exploiting the algorithmic information theory, a new def...
متن کاملSegmentation Assisted Object Distinction for Direct Volume Rendering
Ray Casting is a direct volume rendering technique for visualizing 3D arrays of sampled data. It has vital applications in medical and biological imaging. Nevertheless, it is inherently open to cluttered classification results. It suffers from overlapping transfer function values and lacks a sufficiently powerful voxel parsing mechanism for object distinction. In this work, we are proposing an ...
متن کاملFast Intra Mode Decision for Depth Map coding in 3D-HEVC Standard
three dimensional- high efficiency video coding (3D-HEVC) is the expanded version of the latest video compression standard, namely high efficiency video coding (HEVC), which is used to compress 3D videos. 3D videos include texture video and depth map. Since the statistical characteristics of depth maps are different from those of texture videos, new tools have been added to the HEVC standard fo...
متن کاملObject-Based Classification of UltraCamD Imagery for Identification of Tree Species in the Mixed Planted Forest
This study is a contribution to assess the high resolution digital aerial imagery for semi-automatic analysis of tree species identification. To maximize the benefit of such data, the object-based classification was conducted in a mixed forest plantation. Two subsets of an UltraCam D image were geometrically corrected using aero-triangulation method. Some appropriate transformations were perfor...
متن کامل3D Models Recognition in Fourier Domain Using Compression of the Spherical Mesh up to the Models Surface
Representing 3D models in diverse fields have automatically paved the way of storing, indexing, classifying, and retrieving 3D objects. Classification and retrieval of 3D models demand that the 3D models represent in a way to capture the local and global shape specifications of the object. This requires establishing a 3D descriptor or signature that summarizes the pivotal shape properties of th...
متن کامل